The RWTH large vocabulary continuous speech recognition system

نویسندگان

Hermann Ney

Lutz Welling

Stefan Ortmanns

Klaus Beulen

Frank Wessel

چکیده

In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract normalization, pronunciation lexicon and cross-word triphones, on the recognition performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

The Rwth Speech Recognition System and Spoken Document Retrieval

متن کامل

The RWTH Aachen German and English LVCSR systems for IWSLT-2013

In this paper, German and English large vocabulary continuous speech recognition (LVCSR) systems developed by the RWTH Aachen University for the IWSLT-2013 evaluation campaign are presented. Good improvements are obtained with state-of-the-art monolingual and multilingual bottleneck features. In addition, an open vocabulary approach using morphemic sub-lexical units is investigated along with t...

متن کامل

Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition

This paper studies algorithms for reducing the computational e ort of the mixture density calculations in HMM-based speech recognition systems. These likelihood calculations take about 70 85% of the total recognition time in the RWTH system for large vocabulary continuous speech recognition. To reduce the computational cost of the likelihood calculations, we investigate several space partitioni...

متن کامل

Speech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon Language Model

This paper gives an overview of an architecture and search organization for large vocabulary, continuous speech recognition (LVCSR at RWTH). In the rst part of the paper, we describe the principle and architecture of a LVCSR system. In particular, the issues of modeling and search for phoneme based recognition are discussed. In the second part, we review the word conditioned lexical tree search...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

The RWTH large vocabulary continuous speech recognition system

نویسندگان

چکیده

منابع مشابه

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

The Rwth Speech Recognition System and Spoken Document Retrieval

The RWTH Aachen German and English LVCSR systems for IWSLT-2013

Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition

Speech Input Acoustic Analysis Phoneme Inventory Pronunciation Lexicon Language Model

عنوان ژورنال:

اشتراک گذاری